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Cross-Reference to Other Applications 

The following applications of common assignee are related to the present 
5 application, and are herein incorporated by reference in their entireties: 

"Method and System of Allocating Storage Resources in a Storage Area 
q Network," Ser. No. (to be assigned), Attorney Docket No. 1942.0030000, filed 

I J[ concurrently herewith 

1 ^ "Method, System, and Computer Program Product for Managing Storage 

p 10 Resources," Ser. No. (to be assigned), Attorney Docket No. 1 942.0050000, filed 

o concurrently herewith. 

|= "Method, System, and Computer Program Product for a Data Propagation 

•fl Platform and Applications of Same," Ser. No. (to be assigned), Attorney Docket 

m ' No. 1 942.0070000, concurrently herewith. 

15 Background of the Invention 

Field of the Invention 

The invention relates generally to the field of storage area networks, and 
more particularly to the mirroring of data storage in storage area networks. 

RelatedArt 

2 0 A variety of techniques exist for handling storage of large quantities of 

data, which is typically required for the effective operation of computer systems 
in computer networks. These techniques include host-based and storage-based 
storage management systems. Storage area networks (SANs) have been 



SKGF Ref. No. 1942.0040000 



developed recently as an additional approach to providing access to data in 
computer networks. A SAN is a network linking servers or workstations to 
storage devices. A SAN is intended to increase the pool of storage available to 
each server in the computer network, while reducing the data supply demand on 
servers. 

Data storage, including data stored in SANs, suffers from problems 
associated with data protection. If data is stored at a single location, that data is 
susceptible to loss in the event of damage to the storage device maintaining that 
data. System administrators, and SAN administrators in particular, have to 
confront the difficulty of protecting data on networks that include a large number 
of storage devices, and that may include a wide variety of different storage device 
types and manufactures, communication protocols, and other variations. 

One solution to the problem of data protection is to store redundant copies 
of the data. For example, the data may be stored in a primary storage device, and 
again in a secondary storage device. In this manner, a localized disaster to one 
repository of the data will not affect the redundant copy. The maintenance of 
redundant copies of data presents its own difficulties, however. Such difficulties 
include keeping the redundant stores of data up-to-date in the face of updates to 
the data in primary storage (i.e., data synchronization), among others. 

Therefore, in view of the above, what is needed is a system, method and 
computer program product for maintaining protection of data in a storage area 
network. Furthermore, what is needed is a system, method and computer 
program product for maintaining storage in redundant locations. Still further, 
what is needed is a system, method and computer program product for keeping 
data that is stored in redundant locations up-to-date, or synchronized. 
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Summary of the Invention 

A method, system, and computer program product for mirroring data in 
a computer network is described. At least one connection is established between 
a local storage server and a mirror storage server. A primary storage request is 
5 received from a network host at the local storage server. A mirror storage request 

is sent across the established at least one connection from the local storage server, 
wherein the mirror storage request corresponds to the received primary storage 
request. The mirror storage request is processed at the mirror storage server. A 
first heartbeat signal is sent from the local storage server to the mirror storage 
10 server. A second heartbeat signal is sent from the mirror storage server to the 

local storage server. Data of the received mirror storage request is stored in a 
mirror storage device corresponding to a primary storage device. A response is 
sent across the established at least one connection from the mirror storage server, 
wherein the response indicates whether the storing of data of the received mirror 
15 storage request step was successful. 

In a further aspect of the present invention, an interruption may be 
detected in the second heartbeat signal at the local storage server. The established 
at least one connection is closed. Mirror storage requests are queued. The second 
heartbeat signal is received at the local storage server after the detected 
20 interruption of the second heartbeat signal. The closed at least one connection is 

re-established. The queued mirror storage requests are sent across the re- 
established at least one connection after the re-establishing step. 

In a still further aspect of the present invention a method, system, and 
computer program product for bi-directional mirroring data in a computer 
15 network is described. A first connection between a local storage server and a 

remote storage server is established. A second connection between the local 
storage server and the remote storage server is established. A first local storage 
request from a first network host is received at the local storage server. A first 
local mirror storage request is sent from the local storage server across the first 
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connection, wherein the first local mirror storage request corresponds to the first 
received local storage request. The first local mirror storage request is received 
at the remote storage server. Data received in the first local mirror storage request 
is stored in at least one remote storage device coupled to the remote storage 
server. A first remote storage request is received from a second network host at 
the remote storage server. A first remote mirror storage request is sent from the 
remote storage server across the second connection, wherein the first remote 
mirror storage request corresponds to the received first remote storage request. 
The first remote mirror storage request is received at the remote storage server. 
Data received in the first remote mirror storage request is stored in at least one 
local storage device coupled to the local storage server. 

In a further aspect, a first heartbeat signal is sent from the local storage 
server to the mirror storage server. A second heartbeat signal is sent from the 
remote storage server to the local storage server. An interruption is detected in 
the second heartbeat signal at the local storage server. The established first 
connection is closed. At least a second local storage request is received at the 
local storage server. The at least a second local mirror storage request is queued 
at the local storage server. The second heartbeat signal is received at the local 
storage server after the detected interruption of the second heartbeat signal. The 
closed first connection is re-established between the local storage server and the 
mirror storage server. The queued at least a second local mirror storage requests 
is sent across the re-established first connection after the re-establishing step. 

In a still further aspect, an interruption in the first heartbeat signal is 
detected at the remote storage server. The established second connection is 
closed. The at least a second remote storage request is received at the remote 
storage server. The at least a second remote mirror storage request is queued at 
the remote storage server. The first heartbeat signal is received at the remote 
storage server after the detected interruption of the first heartbeat signal. The 
closed second connection is re-established between the local storage server and 
the mirror storage server. The queued at least a second remote mirror storage 
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requests are sent across the re-established second connection after the re- 
establishing step. 

In a still further aspect, A method, system, and computer program product 
for mirroring data in a computer network is described. A local storage server 
receives a storage request and outputs a mirror storage request. The local storage 
server outputs a first heartbeat signal. A mirror storage server receives a mirror 
storage request. The mirror storage server processes the mirror storage request. 
The mirror storage server outputs a response corresponding to the mirror storage 
request to the local storage server. The mirror storage server outputs a second 
heartbeat signal and receives the first heartbeat signal. The local storage server 
receives the second heartbeat signal. 

Further aspects of the present invention, and further features and benefits 
thereof, are described below. The accompanying drawings, which are 
incorporated herein and form a part of the specification, illustrate the present 
invention and, together with the description, further serve to explain the 
principles of the invention and to enable a person skilled in the pertinent art to 
make and use the invention. 

Brief Description of the Figures 

In the drawings, like reference numbers indicate identical or functionally 
similar elements. Additionally, the left-most digit(s) of a reference number 
identifies the drawing in which the reference number first appears. 

FIG. 1 illustrates an example storage mirror network configuration, 
according to embodiments of the present invention. 

FIG. 2A illustrates an exemplary block diagram for a storage server, 
according to the present invention. 

FIG. 2B illustrates an exemplary block diagram configuration of a local 
storage server and mirror storage server, according to an embodiment of the 
present invention. 
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FIG. 3 illustrates an implementation of a local storage server and a mirror 
storage server prior to the initiation of mirroring, according to an embodiment of 
the present invention. 

FIG. 4 illustrates an implementation of the local storage server and mirror 
storage server of FIG. 3 after the establishment of active storage mirroring, 
according to an embodiment of the present invention. 

FIG. 5 illustrates an example health monitor implementation, according 
to an embodiment of the present invention. 

FIG. 6 illustrates an example implementation of a local storage server for 
processing mirror storage requests, according to an embodiment of the present 
invention. 

FIG. 7 illustrates an example implementation of a mirror storage server 
for processing mirror storage requests, according to an embodiment of the present 
invention. 

FIG. 8 illustrates an example data communication network, according to 
an embodiment of the present invention. 

FIG. 9 shows a simplified five-layered communication model, based on 
an Open System Interconnection (OSI) reference model. 

FIG. 10 shows an example of a computer system for implementing the 



FIG. 11A illustrates an embodiment of a network failure monitor, 
according to an embodiment of the present invention. 

FIG. 1 IB illustrates a mirror storage re-synchronization implementation, 
according to an embodiment of the present invention. 

FIGS. 12A to 12J show flowcharts providing operational steps of 
exemplary embodiments of the present invention. 

FIGS. 13A to 13F show flowcharts providing operational steps of 
exemplary bi-directional mirroring embodiments of the present invention. 



20 



present invention. 
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The present invention will now be described with reference to the 
accompanying drawings. 

Detailed Description of the Preferred Embodiments 
Overview 

The present invention is directed to a method and system that provides for 
data mirroring. Data mirroring refers to the storage of identical data in more than 
one location. Data mirroring represents a way to protect data from localized 
disasters by retaining a redundant copy of primary storage data in a remote, 
secondary storage location. Data mirroring may be accomplished over long 
distances using TCP/IP over Ethernet, Tl, T3, Gigabit Ethernet, or ATM. The 
present invention is particularly applicable to computer networks, including 
storage area networks, that include hosts, servers, and storage subsystems. 

The present invention is further directed to a method and system for 
detecting network failures in a mirror storage configuration, and ensuring that 
data mirroring reliably occurs. The present invention detects when a network 
interruption occurs between primary and secondary storage. The present 
invention queues mirror storage requests during the period of interruption. When 
the network is restored, the queued mirror storage requests are executed, and the 
mirror storage configuration continues normal operation. 

FIG. 1 illustrates an example storage mirror network configuration 100, 
according to embodiments of the present invention. Storage mirror network 
configuration 1 00 comprises local servers 1 02, mirror servers 1 04, a local storage 
server 1 06, a mirror storage 1 08, a primary storage 1 1 0, and a secondary storage 
1 12. Generally, local storage server 106 receives data I/O requests from local 
servers 102, and issues the data I/O requests to primary storage 110. Local 
storage server 1 06 also forwards mirror data I/O requests to mirror storage server 
108. Mirror storage server 108 issues the mirror data I/O requests to secondary 
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storage 1 12. Mirror storage server 108 sends a reply or response to local storage 
server 1 06 that includes information, such as whether the mirror data I/O request 
was successfully executed in secondary storage 1 12. Mirror servers 104 may be 
used as file servers for secondary storage 112. 

Local servers 102 and mirror servers 104 each include one or more hosts 
or servers that may be present in a data communication network. Servers 1 02 and 
104 manage network resources. For instance, one or more of the servers of 
servers 102 and 104 may be file servers, network servers, application servers, 
database servers, or other types of server. Servers 102 and 104 may comprise 
single processor or multi-processor servers. Mirror servers 104 may also be 
referred to as remote servers. Servers 102 and 104 process requests for data and 
applications from networked computers, workstations, and other peripheral 
devices otherwise known or described elsewhere herein. Servers 102 and 104 
output requests to local storage server 106 and mirror storage server 108, 
respectively, to write to, or read data from primary storage 110 and secondary 
storage 112. 

Local storage server 106 receives storage read and write requests from 
local servers 102. Local storage server 106 outputs the storage read and write 
requests to the physical storage devices in primary storage 1 10. Local storage 
server 106 also sends storage read and write requests to mirror storage 108. In 
particular, local storage server 106 sends storage write requests to mirror storage 
server 108 so that mirror storage server 108 may update data stored in secondary 
storage 1 1 2 to match or mirror that of primary storage 1 1 0. Local storage server 
1 06 sends these storage requests to mirror storage server 1 08 as they are received. 
This allows for substantially concurrent data storage in primary storage 1 10 and 
secondary storage 112 at any given time. Mirror storage server 108 sends a 
response to local storage server 106 to indicate whether the data was written to 
secondary storage 112 successfully. Mirror storage server 108 may also be 
referred to as a remote storage server. Local storage server 106 and mirror 
storage server 108 each also monitor the connection between them for network 
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outages. If the connection to the opposite server is lost, mirror storage requests 
are queued by local storage server 106, and/or responses are queued by mirror 
storage server 108. The queued requests and/or responses are forwarded to the 
respective server when the network is restored. 

Primary storage 1 10 and secondary storage 1 12 receive storage read and 
write requests from local storage server 106 and mirror storage server 108, 
respectively. Primary storage 1 10 and secondary storage 1 12 route the received 
storage read and write requests to the corresponding storage device(s), which 
respond by reading or writing data as requested. Primary storage 110 and 
secondary storage 112 each comprise one or more storage devices that may be 
directly coupled to local storage server 106 or mirror storage server 108, 
respectively, and/or may be interconnected in storage area network configurations 
that are coupled to local storage server 106 and mirror storage server 108. For 
example, primary storage 1 10 and secondary storage 1 12 may each comprise one 
or more of a variety of storage devices or LUNs (logic unit numbers), including 
tape systems, JBODs (Just a Bunch Of Disks), floppy disk drives, optical disk 
drives, disk arrays, and other applicable types of storage devices otherwise known 
or described elsewhere herein. Storage devices in primary storage 110 and 
secondary storage 1 12 may be interconnected via SCSI or fibre channel links, or 
other types of links, in a variety of topologies. Example topologies for primary 
storage 110 and secondary storage 112 are more fully described below. 
Secondary storage 112 may also be referred to as remote storage, or redundant 
storage. 

In further embodiments of the present invention, mirror storage server 1 08 
may operate as a primary storage server that mirrors storage requests. In other 
words, mirror storage server 108 may operate as a local storage server for mirror 
servers 104 and secondary storage 1 12, for purposes of mirroring data stored in 
secondary storage 1 12. For example, in embodiments, mirror storage server 1 08 
receives data I/O requests from mirror servers 104, and issues the data I/O 
requests to secondary storage 1 1 2. In such a configuration, secondary storage 1 1 2 
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operates as primary storage. Mirror storage server 108 forwards mirror data I/O 
requests to local storage server 106. In such a configuration, local storage server 
106 operates as a mirror storage server. Local storage server 106 issues the 
mirror data I/O requests to primary storage 110. Local storage server 106 sends 
a reply or response to mirror storage server 108 that includes information, such 
as whether the mirror data I/O request was successfully executed in primary 
storage 110. 

In other words, in embodiments, local storage server 106 can mirror 
storage requests received from local servers 102, while mirror storage server 108 
simultaneously mirrors storage requests received from mirror servers 104. 
Secondary storage 112 acts as redundant storage for local servers 102, while 
primary storage 110 acts as redundant storage for mirror servers 104. For 
instance, half of the storage space of primary storage 1 10 may be allocated as 
primary storage for local servers 102, and half may be allocated as redundant 
storage for mirror servers 104. Likewise, half of the storage space of secondary 
storage 1 12 may be allocated as primary storage for mirror servers 104, and half 
may be allocated as redundant storage for local servers 1 02. In embodiments, the 
primary and redundant storage space allocations may or may not be divided 
equally in primary storage 110 and/or in secondary storage 112, and can be 
divided according to any proportions. 

Operation of mirror storage server 108 as a local storage server 106 is 
analogous to operation of local storage server 106 as such, as described herein. 
The discussion herein focuses primarily on operation of local storage server 106 
as the primary storage server and mirror storage server 108 as the redundant 
storage server. However, the discussion of local storage server 106 as a primary 
storage server is also applicable to the operation of mirror storage 108 as a 
primary storage server, and the discussion of mirror storage server 108 as the 
redundant storage server is applicable to the discussion of local storage server 1 06 
as the redundant storage server. 
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FIG. 2B illustrates an exemplary block diagram configuration of local 
storage server 106 and mirror storage server 108, according to an embodiment of 
the present invention. Local storage server 1 06 comprises a mirror request sender 
220 and a first network failure monitor 230. Mirror storage server 1 08 comprises 
a mirror request receiver 240 and a second network failure monitor 250. 

Mirror request sender 220 receives data storage requests from local 
servers 102, and forwards them to mirror request receiver 240 for mirror storage 
by mirror storage server 1 08, via connection 260. For instance, a polling process 
or thread in local storage server 106 may poll for storage requests from local 
servers 102. The polling process may then forward the storage requests to 
subsequent worker threads or processes in local storage server 106. The worker 
threads or processes perform the actual write to primary storage 1 1 0, and forward 
the storage requests to mirror request sender 220 for mirroring. 

Mirror request receiver 240 receives the mirror storage requests, and 
forwards them to secondary storage 1 12 for storage. Mirror request receiver 240 
may receive an indication from secondary storage 112 whether the storage 
operation was successful, and then send this indication to mirror request sender 
220 as a reply or response via connection 260. 

First network failure monitor 23 0 and second network failure monitor 250 
each generate a health signal, which they forward to each other via second path 
270. The health signal output by first network failure monitor 230 allows local 
storage server 1 06 to notify mirror storage server 1 08 that local storage server 1 06 
is operating. Hence, mirror request sender 220 may send mirror storage requests 
to, and receive responses from mirror request receiver 240. The health signal 
output by second network failure monitor 250 allows mirror storage server 108 
to communicate to local storage server 106 that it is operating, and that mirror 
request receiver 240 is ready to receive mirror storage requests, and to send 
responses. 

In the event of a network interruption, one or both of first and second 
network failure monitors 230 and 250 may not receive a health signal from the 
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other network failure monitor. If first network failure monitor 230 does not 
receive a health signal, it will indicate to mirror request sender 220 to stop 
sending mirror storage requests to mirror request receiver 240, and to begin 
queuing the requests. If second network failure monitor 250 does not receive a 
health signal, it will indicate to mirror request receiver 240 to stop sending 
responses, and to begin queuing them. Once a health signal is reestablished, the 
respective network failure monitor will signal to mirror request sender 220 or 
mirror request receiver 240 to send the mirror storage requests or responses in the 
respective queues, and to begin to send new mirror storage requests, or responses, 
respectively. 

The present invention can be implemented in and operated from a SAN 
device or appliance that interfaces between the hosts and the storage subsystems 
comprising the SAN. For example, local storage server 106 may be a first SAN 
device or appliance that is an interface between local servers 102 and primary 
storage 110. Mirror storage server 1 08 may be a second SAN device or appliance 
that is an interface between mirror servers 104 and secondary storage 112. 

The invention has the advantage of being completely host (operating 
system) independent and storage system independent. Conventional data 
mirroring functionality is primarily found on hosts or storage subsystems, not on 
independent network servers that manage a storage network. As a result, remote 
data mirroring for risk reduction purposes requires a large investment in storage 
array hardware, or in cumbersome host software, as well as causing a significant 
performance penalty. 

Unlike conventional host-based remote mirroring solutions, the present 
invention does not require additional host software, and is compatible with all 
hosts. The invention is completely transparent so there is no need to manage 
mirroring from individual hosts. Instead, for example, an Internet or other 
network accessible graphical user interface may centrally manage mirroring for 
the entire SAN. 

Unlike storage array-based mirroring solutions, the present invention is 
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not tied to a specific storage vendor, and operates with any type of attached 
storage. For example, the present invention operates with fibre channel storage 
devices. The invention obviates the need to invest in dedicated storage hardware 
for the sole purpose of remote mirroring. 

Description in these terms is provided for convenience only. It is not 
intended that the invention be limited to application in this example environment. 
In fact, after reading the following description, it will become apparent to a 
person skilled in the relevant art how to implement the invention in alternative 
environments known now or developed in the future. Further detailed 
embodiments of the elements of storage mirror network configuration 100 are 
discussed below. 

Terminology related to the present invention is described in the following 
section. Embodiments for the local storage server and mirror storage server are 
then provided. Establishment of the mirroring system is discussed in the 
subsequent section. The mirroring of data is then presented, followed by 
configuration of the system heartbeat. This is followed by example operational 
embodiments of the present invention. Exemplary storage area network 
implementations of the storage mirror of the present invention are then presented, 
followed by discussion of application of the present invention in a SAN 
appliance. Finally, an exemplary computer system in which the present invention 
may be implemented is described. 

Terminology 

To more clearly delineate the present invention, an effort is made 
throughout the specification to adhere to the following term definitions as 
consistently as possible. 

Arbitrated A shared lOOMBps Fibre Channel transport supporting up to 
Loop 126 devices and 1 fabric attachment. 
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Fabric 
HBA 
Hub 
Initiator 
JBOD 
LAN 



Logical 
Unit 



LUN 

Point-to- 
point 

Private 
loop 

Public loop 

RAID 

SCSI 

SCSI-3 
Storage 



One or more Fibre Channel switches in a networked 
topology. 

Host bus adapter; an interface between a server or 
workstation bus and a Fibre Channel network. 

In Fibre Channel, a wiring concentrator that collapses a loop 
topology into a physical star topology. 

On a Fibre Channel network, typically a server or a worksta- 
tion that initiates transactions to disk or tape targets. 

Just a bunch of disks; typically configured as an Arbitrated 
Loop segment in a single chassis. 

Local area network; a network linking multiple devices in a 
single geographical location. 

The entity within a target that executes I/O commands. For 
example, SCSI I/O commands are sent to a target and 
executed by a logical unit within that target. A SCSI physical 
disk typically has a single logical unit. Tape drives and array 
controllers may incorporate multiple logical units to which 
I/O commands can be addressed. Typically, each logical unit 
exported by an array controller corresponds to a virtual disk. 

Logical Unit Number; The identifier of a logical unit within 
a target, such as a SCSI identifier. 

A dedicated Fibre Channel connection between two devices. 



A free-standing Arbitrated Loop with no fabric attachment. 

An Arbitrated Loop attached to a fabric switch. 

Redundant Array of Independent Disks. 

Small Computer Systems Interface; both a protocol for 
transmitting large blocks of data and a parallel bus 
architecture. 

A SCSI standard that defines transmission of SCSI protocol 
over, serial links. 

Any device used to store data; typically, magnetic disk media 
or tape. 
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Switch A device providing full bandwidth per port and high-speed 
routing of data via link-level addressing. 

Target Typically a disk array or a tape subsystem on a Fibre 
Channel network. 

TCP Transmission Control Protocol; TCP enables two hosts to 
establish a connection and exchange streams of data; TCP 
guarantees delivery of data and also guarantees that packets 
will be delivered in the same order in which they were sent. 

Topology The physical or logical arrangement of devices in a 
networked configuration. 

UDP User Datagram Protocol; a connectionless protocol that, like 
TCP, runs on top of IP networks. Unlike TCP/IP, UDP/IP 
provides very few error recovery services, offering instead a 
direct way to send and receive datagrams over an IP network. 

WAN Wide area network; a network linking geographically remote 
sites. 

Local and Mirror Server Embodiments 



Structural implementations for the mirror storage configuration of the 
present invention are described at a high-level and at a more detailed level. These 

10 structural implementations are described herein for illustrative purposes, and are 

not limiting. In particular, the present invention as described herein can be 
achieved using any number of structural implementations, including hardware, 
firmware, software, or any combination thereof. For instance, the present 
invention as described herein may be implemented in one or more computer 

15 systems, application-specific boxes, or other devices. In an embodiment, the 

present invention may be implemented with a pair of SAN appliances, which 
provides for an interface between host servers and storage. Such SAN appliances 
include the SANLink™ appliance, developed by StorageApps Inc., located in 
Bridgewater, New Jersey. 
20 FIG * 2A illustrates an exemplary block diagram for local storage server 

1 06, according to embodiments of the present invention. Local storage server 1 06 
comprises mirror request sender 220 and network failure monitor 230. Mirror 
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request sender comprises a worker thread generator module 202 and a connection 
thread generator module 204. First network failure monitor 230 comprises a 
heartbeat thread generator module 206. These modules may be implemented in 
software, hardware, firmware, or any combination thereof. 

The block diagram embodiment for local storage server 106 shown in 
FIG. 2 A is also substantially applicable to mirror storage server 108 of FIG. 2B. 
In an embodiment, mirror request receiver 240 comprises a worker thread 
generator module 202 and a connection thread generator module 204, and second 
network failure monitor 250 comprises a heartbeat thread generator module 206. 
These modules in mirror storage server 108 are substantially structurally and 
operationally similar to those in local storage server 106. The discussion below 
focuses mainly on these modules in local storage server 106, but the discussion 
is also applicable to these modules in mirror storage server 108. 

In embodiments, local storage server 106 and mirror storage server 108 
perform mirroring functions using these modules. These modules are described 
in more detail as follows. 

As shown in FIG. 2A, worker thread generator module 202 generates n 
worker threads 208. In an embodiment, a worker thread is an element of a 
process for performing specific local processing functions. For example, each of 
the n work threads 208 may process a SCSI-based IP Mirroring request from a 
host, and send the results back to that host. Each of the n worker threads 208 may 
execute instructions concurrently with other threads. The number n of worker 
threads generated is determined on an application basis, and may be any number. 

Connection thread generator module 204 comprises a connection array 
216. Connection array 216 comprises an array of n of elements. Each element 
of connection array 216 corresponds to one of the n worker threads 208. In an 
embodiment, each of the n elements includes a socket, a timestamp parameter, 
and other information, such as a status parameter. The functions of these 
elements will be further described below. 
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Connection thread generator module 204 generates n connection threads 
210. Each thread of n connection threads 210 corresponds to one of the n worker 
threads 208. Each thread of n connection threads 210 is generated according to 
information stored in the corresponding element of connection array 216. In an 
embodiment, the n connection threads 210 in mirror request receiver 240 are 
generated to receive storage mirroring requests from n worker threads 208 of 
mirror request sender 220. The n connection threads 2 1 0 in mirror request sender 
220 are generated to receive storage mirroring requests from n worker threads 208 
of mirror request receiver 240. The n connection threads 210 accept the mirror 
storage requests from their corresponding n worker threads 208. The functions 
of n connection threads 210 will be further described below. 

Heartbeat thread generator module 206 generates a heartbeat sender thread 
212 and a heartbeat receiver thread 214. Both heartbeat sender thread 212 and 
heartbeat receiver thread 2 1 4 are dormant if the mirroring function is not engaged. 
When the mirroring function is engaged, heartbeat sender thread 212 generates 
a heartbeat signal, which is received by heartbeat receiver thread 214 on the 
opposite server. Heartbeat sender thread 212 and heartbeat receiver thread 214 
allow the storage mirror to determine whether a remote network (i.e., local 
storage server 106, mirror storage server 108, or physical network) is "alive." 
The functions of heartbeat sender thread 212 and heartbeat receiver thread 214 
will be further described below. 

The embodiments for the storage mirror of the present invention described 
above are provided for purposes of illustration. These embodiments are not 
intended to limit the invention. Alternate embodiments, differing slightly or 
substantially from those described herein, will be apparent to persons skilled in 
the relevant art(s) based on the teachings contained herein. 
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Establishing the Mirroring System of the Present Invention 

FIG. 3 illustrates an implementation of local storage server 1 06 and mirror 
storage server 108 . prior to the initiation of mirroring, according to an 
embodiment of the present invention. Essentially, FIG. 3 illustrates the invention 
in a "dormant" mirroring state. In FIG. 3, local storage server 106 comprises n 
worker threads 208a, connection array 216a, heartbeat sender thread 212a, and 
heartbeat receiver thread 214a. Mirror storage server 108 comprises n worker 
threads 208b, connection array 2 1 6b, heartbeat sender thread 2 1 2b, and heartbeat 
receiver thread 214b. 

FIG. 4 illustrates an implementation of local storage server 1 06 and mirror 
storage server 108 of FIG. 3 after the establishment of active storage mirroring, 
according to an embodiment of the present invention. FIG. 4 illustrates an 
embodiment where mirroring of data may occur in both directions. In other 
words, local storage server 106 may send mirror storage requests from local 
servers 102 to mirror storage server 108, and mirror storage server 108 may send 
mirror storage requests from mirror servers 104 to local storage server 106. The 
present invention is applicable to one-way and two-way mirroring, and further 
configurations. 

In a preferred embodiment, when a storage mirror is defined, local 
storage server 106 establishes n first TCP (Transmission Control Protocol) 
connections 402 with mirror storage server 108. Connection array 216a 
comprises n elements 410a. The n first TCP connections 402 are established by 
connecting each socket stored in the n elements 410a of connection array 216a 
with one of n connection threads 210b. Mirror storage server 108, in turn, 
establishes n second TCP connections 404 with local storage server 106 by 
connecting each socket stored in the n elements 410b of connection array 216b 
with one of n connection threads 210a. In a preferred embodiment, storage 
requests are sent, and responses are received according to Internet protocol (IP). 
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Connection protocols other than TCP, and data protocols other than IP are also 
within the scope and spirit of the present invention. 

Once n first and n second TCP connections 402 and 404 are established, 
local storage server 106 and mirror storage server 108 operate to mirror storage 
requests. This mirroring function is described further in the following section. 

In a preferred embodiment, when health monitor functionality is engaged, 
heartbeat sender threads 212a and 212b send a UDP (User Datagram Protocol) 
message on a first heartbeat signal 406 and a second heartbeat signal 408, 
respectively, at regular intervals, to the heartbeat receiver threads 2 14b and 2 1 4a, 
respectively, on the opposite server. This allows the opposite server to determine 
whether the heartbeat sending server is active. UDP has the advantage of not 
requiring a constant signal connection between sender and receiver. Other 
applicable message protocols are also within the scope and spirit of the present 
invention. 

Mirroring Data According to the Present Invention 

After the IP Mirror connections are established, the worker threads on the 
local server process mirroring requests from hosts. For instance, the worker 
threads may process SCSI-based IP mirroring requests from the host. FIG. 6 
illustrates an example implementation of local storage server 106 for processing 
mirror storage requests, according to an embodiment of the present invention. 
The discussion below in reference to FIG. 6 and FIG. 7 focuses on one-way, or 
unidirectional storage mirroring, where local storage server 106 sends mirror 
storage requests from local servers 1 02 to mirror storage server 1 08. The present 
invention as described herein, however, is also applicable to two-way, or 
bidirectional storage mirroring, where mirror storage server 1 08 also sends mirror 
storage requests from mirror servers 104 to local storage server 106. 

In FIG. 6, a local worker thread m 602 receives a storage request 608. 
Storage request 608 includes data and storage location information for storing the 
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data in storage. Local worker thread m 602 outputs a mirror storage request for 
storage request 608 on a first TCP connection m 612. First TCP connection m 
612 was established by connecting the socket stored in an element m 606 of 
connection array 2 1 6a with a connection thread operating in mirror storage server 
108. 

FIG. 7 illustrates an example implementation of mirror storage server 1 08 
for processing mirror storage requests, according to an embodiment of the present 
invention. FIG. 7 shows a mirror connection thread m 704 that receives first TCP 
connection m 612. In an embodiment, mirror connection thread m 704 processes 
the mirror request storage received on first TCP connection m 6 1 2, by storing the 
data affiliated with storage request 608 in mirror or secondary storage. Mirror 
connection thread m 704 sends the data affiliated with storage request 608 to 
secondary storage via mirror storage request 708. 

Mirror connection thread m 704 times out periodically, if mirror requests 
are not received on first TCP connection m 612 after a predetermined amount of 
time. When this happens, mirror connection thread m 704 checks an element m 
706 in connection array 216b to determine whether first TCP connection m 612 
has been closed. In an embodiment, element m 706 includes a status parameter. 
The status parameter indicates whether first TCP connection m 612 is closed or 
open. The status parameter may be modified by a heartbeat sender thread 2 1 2 to 
indicate a closed or open condition for first TCP connection m 612, as described 
in the text below with respect to FIG. 5. If mirror connection thread m 704 
determines from the status parameter in element m 706 that first TCP connection 
m 612 is closed, mirror connection thread m 704 closes its connection and exits. 
If mirror connection thread m 704 determines that first TCP connection m 612 is 
open, mirror connection thread m 704 will reinitiate monitoring of first TCP 
connection m 6 1 2 . 

Mirror connection thread m 704 receives a response via mirror storage 
request 708 from secondary storage as to whether the data affiliated with storage 
request 608 was successfully mirrored, or stored in secondary storage. Mirror 
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connection thread m 704 sends this response to local storage server 106 on first 
TCP connection m 612. 

The response from mirror storage server 108 is received by local worker 
thread m 602 on first TCP connection m 6 12. Local worker thread m 602 outputs 
the response as response 610. 

Worker threads can process mirroring requests both asynchronously and 
synchronously. In an asynchronous mirroring mode, local worker thread m 602 
processes the request to store data to primary storage (for example, a SCSI 
request), sends the result to the requesting host, and then waits for a response 
from mirror storage server 108 as to the success of mirror storage of the data. In 
a synchronous mirroring mode, local worker thread m 602 processes the request 
to store data to primary storage, and waits for a response from mirror storage 
server 108 before sending the results of the primary storage request to the host. 

Description of data mirroring in these terms is provided for convenience 
only. It is not intended that the invention be limited to application in this example 
environment. In fact, after reading the description herein, it will become apparent 
to a person skilled in the relevant art how to implement the invention in 
alternative environments known now or developed in the future. 

Network Failure Detector/Health Monitor of the Present Invention 

FIG. 5 illustrates an example health monitor implementation in local 
storage server 106, according to an embodiment of the present invention. The 
health monitor implementation for local storage server 106 of FIG. 5 is also 
applicable to mirror storage server 108. In embodiments, the health monitor 
implementations for local storage server 106 and mirror storage server 108 
interact to detect network failures and to monitor the health of the complete 
storage mirror implementation. 

In FIG. 5, heartbeat sender thread 212 sends messages on first heartbeat 
signal 406 to the heartbeat receiver thread at the opposite server, at predetermined 
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time intervals. For example, in an embodiment, heartbeat sender thread 2 1 2 may 
send UDP messages on first heartbeat signal 406 every 1.5 seconds. After 
sending a UDP message, heartbeat sender thread 212 checks each element of 
connection array 216. In FIG. 5, connection array 216 includes elements 410. If 
the timestamp parameter of an element of connection array 216 is "old", i.e., 
contains a value of time greater than a predetermined time amount, a heartbeat 
sender thread 212 enters the status parameter of the element as "down" and 
closed. For example, in an embodiment, connection array 216 may indicate an 
element as down and closed if the timestamp parameter is older than 4 seconds. 
Other time amounts are applicable to the present invention. 

Heartbeat receiver thread 214 receives second heartbeat signal 408 from 
the heartbeat sender thread at the opposite server at the above described 
predetermined time intervals. In an embodiment, heartbeat receiver thread 214 
remains dormant while it waits for a UDP message from the heartbeat sender 
thread of the other server. When a UDP message arrives on second heartbeat 
signal 408, heartbeat receiver thread 214 updates or resets the timestamp 
parameter on each element of connection array 216. 

As described in the previous section with respect to FIGS. 6 and 7, a 
connection thread times out if a request from the opposite server has not arrived. 
Upon such a timeout, the connection thread checks the status parameter of the 
corresponding element of connection array 216 to see if the connection has been 
marked down. If so, the connection thread closes its connection and exits. 

After a connection breaks (e.g., due to a disaster affecting a server or 
network), local storage server 106 and/or mirror storage server 108 no longer 
receives UDP messages from the other server, which causes the timestamp 
parameters of each element of connection array 2 1 6 to expire. Once this occurs, 
(a) the heartbeat sender thread of the operating server closes all connections in 
connection array 216 , (b) the connection threads of the operating server close the 
corresponding connections and exit, and (c) the heartbeat sender thread of the 
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operating server continues to send UDP messages to the opposite server at the 
rate of the predetermined time interval. 

In a preferred embodiment, after such an interruption, when heartbeat 
receiver thread 214 on the operating server receives the next UDP message, it 
starts a timer corresponding to a probationary interval. If the heartbeat receiver 
thread 214 continues to receive UDP packets during the duration of the 
probationary interval without interruption, the operating server reestablishes all 
connections in connection array 216. To reestablish the connections, each status 
parameter is modified to indicate the connection is "up" or open, and the 
connections are considered to be operating again. If the heartbeat receiver 
experiences an interruption during the probationary interval, the timer is restarted 
upon receipt of the next UDP message. For example, in an embodiment, 
heartbeat receiver thread 214 may set the probationary interval to a value of 10 
seconds. The probationary interval effectively prevents the reestablishment of 
connections until the network has reached a stable state. In alternative 
embodiments, connections may be reestablished upon the receipt of the first UDP 
message, without a probationary interval. 

In an embodiment, a local worker thread can determine that its 
corresponding connection is down by checking the status parameter of the 
corresponding element of connection array 216 to see if the connection has been 
marked down. For instance, the local worker thread may check the status 
parameter of the corresponding element of connection array 216 before 
attempting to send a mirror storage request. 

During such an interruption, mirror storage requests received by local 
storage server 106 must be "journaled" or queued, to be later replayed when the 
interruption is over. FIG. 11B illustrates a mirror storage re-synchronization 
implementation, according to an embodiment of the present invention. 

As shown in FIG. 1 IB, during an interruption, n worker threads 208 send 
mirror storage requests into a mirror storage request journal 1104 as journal 
storage requests 1110. Mirror storage request journal 1 1 04 maintains a record of 
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all mirror storage requests that occur during an interruption. In an embodiment, 
journal storage requests 1 1 10 are maintained in separate journals on a LUN-by- 
LUN basis, as shown in FIG. 1 IB. For example, mirror storage requests relating 
to LUN 0 are maintained in first LUN journal 1 106. Mirror storage requests 
relating to LUN 1 are maintained in second LUN journal 1 108. Any number of 
subsequent journals may exist for subsequent LUNs. Journal storage requests 
1110 may be maintained and organized in other fashions, such as on a worker 
thread-by-worker thread basis, etc. 

After the interruption is over, journal mirror storage requests must be 
replayed. FIG. 1 1 A illustrates an embodiment of network failure monitor 230, 
according to an embodiment of the present invention. Heartbeat thread generator 
module 206 in network failure monitor 230 may generate re-synchronization 
threads 1 102. Re-synchronization threads 1 102 are used to re-play the mirror 
storage requests stored in the journals of mirror storage request journal 1 1 04. As 
shown in FIG. 1 IB, re-synchronization threads 1 102 read the journals of mirror 
storage request journal 1 104, and send the journaled storage requests to mirror 
storage server 108 across first TCP connections 402. Any number of re- 
synchronization threads may be generated, as is required by the particular 
application. Re-synchronization threads in mirror storage server 108 replay 
journaled mirror storage requests in a similar fashion. 

Description of the network failure detector/health monitor of the present 
invention in these terms is provided for convenience only. It is not intended that 
the invention be limited to application in this example environment. In fact, after 
reading the description herein, it will become apparent to a person skilled in the 
relevant art how to implement the invention in alternative environments known 
now or developed in the future. 
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Operational Embodiments of the Present Invention 

FIG. 12A shows a flowchart 1200 providing operational steps of an 
example embodiment of the present invention. FIGS. 12B-J show operational 
steps of further embodiments of the present invention. The steps of FIGS. 12 A- J 
may be implemented in hardware, firmware, software, or a combination thereof. 
For instance, the steps of FIGS. 12 A- J may be implemented by local storage 
server 106 and mirror storage server 108. Furthermore, the steps of FIGS. 12A-J 
do not necessarily have to occur in the order shown, as will be apparent to 
persons skilled in the relevant art(s) based on the teachings herein. Other 
structural embodiments will be apparent to persons skilled in the relevant art(s) 
based on the discussion contained herein. These steps are described in detail 
below. 

The process begins with step 1202. In step 1202, at least one connection 
is established between a local storage server and a mirror storage server. In an 
embodiment, the one or more connections between the local storage server and 
the mirror storage server are TCP connections. Other connection protocols are 
also within the scope and spirit of the invention. For example, in an embodiment, 
the one or more connections are included in first TCP connections 402, from local 
storage server 1 06 to mirror connection threads 2 1 0b in mirror storage server 108. 

In step 1 204, a primary storage request is received from a network host at 
the local storage server. For example, in an embodiment, the primary storage 
request is received from local servers 1 02 by local worker thread m 602 as storage 
request 608. 

In step 1 206, a mirror storage request is sent across the established at least 
one connection from the local storage server to the mirror storage server, wherein 
the mirror storage request corresponds to the received primary storage request. 
For example, in an embodiment, the mirror storage request is sent by local worker 
thread m 602 on first TCP connection /w 612 from local storage server 106. 
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For example, in an embodiment, the mirror storage request is received by 
mirror connection thread m 704 in mirror storage server 108. 

In step 1208, the mirror storage request is processed at the mirror storage 
server. For example, in an embodiment, mirror connection thread m 704 
undertakes the processing of the mirror storage request. 

In step 1210, a first heartbeat signal is sent from the local storage server 
to the mirror storage server. For example, the first heartbeat signal is first 
heartbeat signal 406. 

In step 1212; a second heartbeat signal is sent from the mirror storage 
server to the local storage server. For example, the second heartbeat signal is 
second heartbeat signal 408. 

In an embodiment, step 1208 comprises the step of storing data of the 
received mirror storage request in a mirror storage device corresponding to a 
primary storage device. For example, the data of the mirror request is stored in 
a storage device/LUN of secondary storage 1 12 for the purposes of mirroring the 
data stored in a storage device/LUN of primary storage 110. In an embodiment, 
flowchart 1200 comprises the further step of sending a response across the 
established at least one connection from the mirror storage server to the local 
storage server, wherein the response indicates whether the storing of data of the 
received mirror storage request step was successful. 

FIG. 12B provides additional exemplary steps for flowchart 1200 of FIG. 

12A: 

In step 1214, an interruption is detected in the second heartbeat signal at 
the local storage server. 

In step 1216, the established at least one connection is closed. For 
example, in an embodiment, local storage server 1 06 and/or mirror storage server 
108 closes first TCP connection m 612 when it detects an interruption in second 
heartbeat signal 408.. 

In step 1218, mirror storage requests that result from primary storage 
requests that are received during the detected interruption are queued. For 
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example, in an embodiment, the mirror storage requests are sent as journal 
storage requests 1 1 10 to mirror storage request journal 1 104 for queuing. In an 
embodiment, journal storage requests 1 1 10 are stored in mirror storage request 
journal 1 104 on a per LUN basis, in LUN journals 1 106, 1 108, etc. 

FIG. 12C provides additional exemplary steps for flowchart 1200 of FIG. 

12A: 

In step 1220, the second heartbeat signal is received at the local storage 
server after the detected interruption of the second heartbeat signal. The local 
storage server 106 may receive second heartbeat signal 408 once the network 
interruption goes away, or is repaired. 

In step 1222, the closed at least one connection is re-established between 
the local storage server and the mirror storage server. For example, in an 
embodiment, first TCP connection m 612 is re-established between local storage 
server 106 and mirror server 108. 

In an embodiment, the flowchart of FIG. 12C further comprises the step 
of sending the queued mirror storage requests across the re-established at least 
one connection after re-establishing step 1222. For example, in an embodiment, 
re-synchronization threads 1102 are generated by heartbeat thread generator 
module 206. The re-synchronization threads 1 102 read the journaled mirror 
requests from mirror storage request journal 1104, and send them to mirror 
storage server 108 via first TCP connection 402. 

In an embodiment, step 1214 comprises the step of detecting an 
interruption in the second heartbeat signal at the local storage server that has a 
duration longer than a first predetermined amount of time. 

FIG. 12D provides exemplary steps for step 1222 of FIG. 12C. In an 
embodiment, step 1222 comprises the following steps: 

In step 1224, the second heartbeat signal is monitored for a probationary 
interval of time. As described above, second heartbeat signal 408 may be 
monitored for a period of time to determine whether it is stable. 
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In step 1226, the closed at least one connection between the local storage 
server and the mirror storage server is re-established only if no interruptions in 
the second heartbeat signal are detected during monitoring step 1224. As 
described above, first TCP connection m 612 may be re-established once the 
network connection has stabilized between local storage server 106 and mirror 
storage server 108. 

FIG. 12E provides exemplary steps for step 1202 of FIG. 12 A. In an 
embodiment, step 1202 comprises the following steps: 

In step 1228, n connections are established between the local storage 
server and the mirror storage server, wherein each of the n connections is between 
one of n worker threads in the local storage server and one of n connection 
threads in the mirror storage server, wherein n *1. In an embodiment, the n 
established connections on the local storage server and the n established 
connections are TCP connections. For example, n first TCP connections 402 may 
be established between n worker threads 208a and connection threads 210b. 

In step 1 230, a local connection array of n elements is stored on the local 
storage server, wherein each element of the local connection array corresponds 
to one of the n local worker thread that operates on the local storage server. For 
example, in an embodiment, the local connection array of n elements is 
connection array 216a stored in local storage server 106. Each element of 
connection array 216a corresponds to one of the n local worker threads 208a. 

In step 1232, a mirror connection array of n elements is stored on the 
mirror storage server, wherein each element of the mirror connection array 
corresponds to one of the n connection threads on the mirror storage server. For 
example, in an embodiment, the mirror connection array of n elements is 
connection array 216b stored in mirror storage server 108. Each element of 
connection array 216b corresponds to one of the n mirror worker threads 208b. 

In an embodiment, step 1230 comprises the step of storing a local 
connection array of h elements, wherein each element comprises a socket and a 
timestamp. For example, in an embodiment, each element of elements 410a 

SKGF Ref. No. 1942.0040000 



-29- 



comprises the socket parameter and timestamp parameter. In an embodiment, 
step 1202 comprises the step of establishing each of the n connections according 
to the socket parameter stored in the corresponding one of the n elements of the 
stored local connection array. For example, in an embodiment, the n connections 
are each established according to the socket parameter in a corresponding one of 
the n elements 410a of connection array 216a. 

In an embodiment, step 1232 comprises the step of storing a mirror 
connection array of n elements, wherein each element comprises a socket 
parameter and a timestamp parameter. For example, in an embodiment, each 
element of elements 410b comprises the socket parameter and timestamp 
parameter. 

FIG. 12F provides additional exemplary steps for flowchart 1200 of FIG. 

12 A: 

In step 1234, a mirror heartbeat sender thread and a mirror heartbeat 
receiver thread are established in the mirror storage server. For example, the 
mirror heartbeat sender thread and mirror heartbeat receiver thread are heartbeat 
sender thread 212b and heartbeat receiver thread 214b, respectively. 

In step 1236, a local heartbeat sender thread and a local heartbeat receiver 
thread are established in the local storage server. For example, the local heartbeat 
sender thread and local heartbeat receiver thread are local heartbeat sender thread 
212a and local heartbeat receiver thread 214a, respectively. In a further 
embodiment, step 1218 further comprises the step of sending the first heartbeat 
signal from the local heartbeat sender thread to the mirror heartbeat receiver 
thread. In a further embodiment, step 1220 further comprises the step of sending 
the second heartbeat signal from the mirror heartbeat sender thread to the local 
heartbeat receiver thread on the local storage server. 

In an embodiment, step 1234 and step 1236 each comprise the step of 
sending a message at time intervals of a second predetermined amount of time. 
In an embodiment, step 1234 and step 1236 each comprises the step of sending 
a user datagram protocol (UDP) message at time intervals of the second 
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predetermined amount of time. For example, in an embodiment, the second 
predetermined amount of time is 1.5 seconds. In alternative embodiments, the 
second predetermined time amount may be any number. 

In an embodiment, step 1210 comprises the step of updating the 
timestamp parameter of each of the n elements of the mirror connection array 
whenever the message on the first heartbeat signal is received by the mirror 
heartbeat receiver thread. For example, in an embodiment, each element of 
connection array 216b is updated when the UDP message is received on first 
heartbeat signal 406. 

In an embodiment, step 1212 comprises the step of updating the 
timestamp parameter of each of the n elements of the local connection array 
whenever the message on the second heartbeat signal is received by the local 
heartbeat receiver thread. For example, in an embodiment, each element of 
connection array 216a is updated when the UDP message is received on second 
heartbeat signal 408. 

In an embodiment, step 1214 comprises the step of indicating in one of the 
n elements of the mirror connection array that the corresponding one of the 
established n connections is closed if the timestamp parameter of the one of the 
n elements is older than the first predetermined amount of time. For example, in 
an embodiment, one of the n established connections of second TCP connections 
404 closed on mirror storage server 108 if the corresponding timestamp is older 
than 4.0 seconds. In alternative embodiments, the first predetermined amount of 
time may be any number. One of the n established connections on the mirror 
storage server may be closed by placing a closed entry into the timestamp or into 
a status parameter of the corresponding element, for example. 

FIG. 12G provides exemplary steps for step 1216 of FIG. 12B. In an 
embodiment, step 1216 comprises the following steps: 

In step 1238, one of the n connection threads on the mirror storage server 
is timed out if a request on the corresponding one of the established n connections 
has not arrived in a third predetermined amount of time. For example, in an 
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embodiment, the mirror connection thread m 704 times out if a mirror storage 
request is not received on first TCP connection m 612. 

In step 1240, the timed out connection thread closes and exits if the 
corresponding one of the n elements in the mirror connection array is indicated 
to be closed. For example, mirror connection thread m 704 closes and exits if 
element m 706 is indicated to be closed. 

FIG. 12H provides additional exemplary steps for flowchart 1200 of FIG. 

12A: 

In step 1242, a first message is received on the first heartbeat signal after 
an interruption of the first heartbeat signal. For example, a message is received 
on first heartbeat signal 406 after a period of interruption due to local storage 
server 1 06 going down, a break in the network, or other type of server or network 
interruption. Once the first message is received, a timer may be started 
corresponding to a probationary interval. If an interruption in messages received 
on the first heartbeat signal occurs during the probationary interval, step 1242 
may be repeated. 

In step 1244,* the n connections are re-established on the local storage 
server to the corresponding connection threads on the mirror storage server. 

In an embodiment, step 1220 comprises the step of receiving a first 
message on the second heartbeat signal after an interruption of the second 
heartbeat signal. For example, a message may be received on second heartbeat 
signal 408 after a period of interruption due to mirror storage server 108 going 
down, a break in the network, or other type of server or network interruption. 

FIG. 121 provides additional exemplary steps for flowchart 1200 of FIG. 
12 A, wherein the local storage server operates in an asynchronous mirror mode. 
For example, in an embodiment, local storage server 106 may operate in an 
asynchronous mirror mode, where local storage server 106 sends responses 
regarding attempt to store data in primary storage to hosts before receiving mirror 
storage responses from mirror storage server 108. 
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In step 1 246, the primary storage request is processed. For example, data 
associated with the primary storage request is stored in primary storage 110. 

In step 1248, the results of the processed primary storage request are sent 
to the network host. For example, an indication of whether the data was stored 
in primary storage 110 successfully is sent to the host in local servers 102. 

FIG. 12J provides additional exemplary steps for flowchart 1200 of FIG. 
12A, wherein the local storage server is operating in a synchronous mirror mode. 
For example, in an embodiment, local storage server 106 may operate in a 
synchronous mirror mode, where local storage server 106 waits for a response 
from mirror storage server 1 08 before sending a primary storage attempt response 
to the host. 

In step 1250, the primary storage request is processed. For example, data 
associated with the storage request is stored in primary storage 110. 

In step 1252, a response corresponding to the sent mirror storage request 
from the mirror storage server is waited for. For example, local storage server 
106 waits for an indication from mirror storage server 108 whether the mirror 
storage request was successful. 

In step 1254, the results of the processed primary storage request are sent 
to the network host after the response is received from the mirror storage server. 
For example, an indication of whether the data was stored in primary storage 1 10 
successfully is sent to the host in local servers 1 02. An indication of whether the 
mirror storage request was successful may also be sent to the host. 

In an embodiment, the process of flowchart 1200 of FIG. 12A comprises 
the further step of determining whether a LUN related to the received primary 
storage request is designated to be mirrored. For example, some LUNs in primary 
storage 110 may be mirrored in secondary storage 112, while other LUNs in 
primary storage may not be mirrored. Hence, this step determines whether a 
mirror storage request may be appropriate for a particular LUN. 

FIG. 13A shows a flowchart 1300 providing operational steps of an 
example bi-directional mirroring embodiment of the present invention. FIGS. 
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13B-J show operational steps of further embodiments of the present invention. 
The steps of FIGS. 13A-J may be implemented in hardware, firmware, software, 
or a combination thereof. For instance, the steps of FIGS. 13A-J may be 
implemented by local storage server 106 and mirror storage server 108. 
Furthermore, the steps of FIGS. 13 A- J do not necessarily have to occur in the 
order shown, as will be apparent to persons skilled in the relevant art(s) based on 
the teachings herein. Other structural embodiments will be apparent to persons 
skilled in the relevant art(s) based on the discussion contained herein. These 
steps are described in detail below. 

The process begins with step 1302. In step 1302, a first connection 
between a local storage server and a remote storage server is established. For 
example, in an embodiment, the first connection is included in n first TCP 
connections 402, between local storage server 1 06 and mirror storage server 1 08. 

In step 1 304, a second connection between the local storage server and the 
remote storage server is established. For example, in an embodiment, the second 
connection is included in n second TCP connections 404, between local storage 
server 106 and mirror storage server 108. 

In step 1306, a first local storage request from a first network host is 
received at the local storage server. For example, in an embodiment, the first 
local storage request is received from local servers 102 as storage request 608. 

In step 1308, a first local mirror storage request is sent from the local 
storage server across the first connection, wherein the first local mirror storage 
request corresponds to the first received local storage request. For example, in 
an embodiment, the first local mirror storage request is sent by local worker 
thread m 602 on first TCP connection m 612 from local storage server 106. 

In step 1 3 1 0, the first local mirror storage request is received at the remote 
storage server. For example, in an embodiment, the first local mirror storage 
request is received by mirror connection thread m 704 in mirror storage server 
108. 
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In step 1312, data received in the first local mirror storage request is 
stored in at least one remote storage device coupled to the remote storage server. 
For example, the data of the first local mirror storage request is stored in a storage 
device/LUN of secondary storage 112. 

In step 1314, a first remote storage request is received from a second 
network host at the remote storage server. For example, in an embodiment, the 
first remote storage request is received from mirror servers 104. 

In step 1 3 1 6, a first remote mirror storage request is sent from the remote 
storage server across the second connection, wherein the first remote mirror 
storage request corresponds to the received first remote storage request. For 
example, in an embodiment, the first remote mirror storage request is sent by a 
remote worker thread on a second TCP connection from mirror storage server 
108. 

In step 1 3 1 8, the first remote mirror storage request is received at the local 
storage server. For example, in an embodiment, the first remote mirror storage 
request is received by a local connection thread in local storage server 106. 

In step 1320, data received in the first remote mirror storage request is 
stored in at least one local storage device coupled to the local storage server. For 
example, the data of the first remote mirror storage request is stored in a storage 
device/LUN of primary storage 1 10. 

FIG. 1 3B provides additional exemplary steps for flowchart 1 300 of FIG. 

13A: 

In step 1322, a first heartbeat signal is sent from the local storage server 
to the mirror storage server. For example, the first heartbeat signal is first 
heartbeat signal 406. 

In step 1324, a second heartbeat signal is sent from the remote storage 
server to the local storage server. For example, the second heartbeat signal is 
second heartbeat signal 408. 

FIG. 13C provides additional exemplary steps for flowchart 1300 of FIG. 

13 A: 
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In step 1326, an interruption is detected in the second heartbeat signal at 
the local storage server. 

In step 1328, the established first connection is closed. For example, in 
an embodiment, local storage server 106 and/or mirror storage server 108 closes 
first TCP connection m 612 when it detects an interruption in second heartbeat 
signal 408. 

In step 1 330, at least a second local storage request is received at the local 
storage server. For example, local servers 102 continues to send storage requests 
to local storage server 106 

In step 1332, an at least a second local mirror storage request is queued 
at the local storage server, wherein the at least a second local mirror storage 
request corresponds to the received at least a second local storage request. For 
example, in an embodiment, the subsequent local mirror storage requests are sent 
as journal storage requests 1 1 10 to mirror storage request journal 1 104 in local 
storage server 106 for queuing. In an embodiment, journal storage requests 1110 
are stored in mirror storage request journal 1 104 on a per LUN basis, in LUN 
journals 1106, 1108, etc. 

FIG. 1 3D provides additional exemplary steps for flowchart 1 300 of FIG. 

13A: 

In step 1334, the second heartbeat signal is received at the local storage 
server after the detected interruption of the second heartbeat signal. Local storage 
server 1 06 may receive second heartbeat signal 408 once the network interruption 
goes away, or is repaired, for example. 

In step 1336, the closed first connection is re-established between the 
local storage server and the mirror storage server. 

In an embodiment, flowchart 1330 includes the further step of sending the 
queued at least a second local mirror storage requests across the re-established 
first connection after re-establishing step 1336. For example, in an embodiment, 
re-synchronization threads 1102 are generated by heartbeat thread generator 
module 206 in local storage server 106. The re-synchronization threads 1102 
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read the journaled mirror requests from mirror storage request journal 1 104, and 
send them to mirror storage server 108 via n first TCP connections 402. 

FIG. 13E provides additional exemplary steps for flowchart 1300 of FIG. 

13A: 

In step 1338, an interruption in the first heartbeat signal is detected at the 
remote storage server. 

In step 1340, the established second connection is closed. For example, 
in an embodiment, local storage server 106 and/or mirror storage server 108 
closes the second TCP connection when it detects an interruption in first heartbeat 
signal 406. 

In step 1342, at least a second remote storage request is received at the 
remote storage server. For example, mirror servers 1 04 continues to send storage 
requests to mirror storage server 108 

In step 1 344, an at least a second remote mirror storage request is queued 
at the remote storage server, wherein the at least a second remote mirror storage 
request corresponds to the received at least a second remote storage request. For 
example, in an embodiment, the subsequent remote mirror storage requests are 
sent as journal storage requests 1 1 10 to a mirror storage request journal 1 104 in 
mirror storage server 108 for queuing. In an embodiment, journal storage 
requests 1 1 10 are stored in mirror storage request journal 1 104 on a per LUN 
basis, in LUN journals 1 106, 1 108, etc. 

FIG. 13F provides additional exemplary steps for flowchart 1300 of FIG. 

13A: 

In step 1346, the first heartbeat signal is received at the remote storage 
server after the detected interruption of the first heartbeat signal. Mirror storage 
server 108 may receive first heartbeat signal 406 once the network interruption 
goes away, or is repaired. 

In step 1 348, the closed second connection is re-established between the 
local storage server and the mirror storage server. 
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In an embodiment, flowchart 1330 includes the further step of sending the 
queued at least a second remote mirror storage request across the re-established 
second connection after re-establishing step 1348. For example, in an 
embodiment, re-synchronization threads 1 102 are generated by heartbeat thread 
generator module 206 in mirror storage server 108. The re-synchronization 
threads 1102 read the journaled mirror requests from mirror storage request 
journal 1104, and send them to local storage server 106 via n second TCP 
connections 404. 

The steps shown and/or described above in regards to flowchart 1 200 and 
FIGS. 1 2A-J are applicable to the bi-directional mirroring described in flowchart 
1300, FIGS. 13A-F, and surrounding text. 

For illustrative purposes, the operation of the invention is represented 
above by flowcharts, such as flowchart 1200 in FIG. 12A and flowchart 1300 in 
FIG. 13 A. It should be understood, however, that the use of flowcharts is for 
illustrative purposes only, and is not limiting. For example, the invention is not 
limited to the operational embodiment(s) represented by the flowcharts. Instead, 
alternative operational embodiments will be apparent to persons skilled in the 
relevant art(s) based on the discussion contained herein. Also, the use of 
flowcharts should not be interpreted as limiting the invention to discrete or digital 
operation. In practice, as will be appreciated by persons skilled in the relevant 
art(s) based on the herein discussion, the invention can be achieved via discrete 
or continuous operation, or a combination thereof. Further, the flow of control 
represented by the flowcharts is provided for illustrative purposes only. Steps 
may occur in a different order than shown. Furthermore, as will be appreciated 
by persons skilled in the relevant art(s), other operational control flows are within 
the scope and spirit of the present invention. 
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Example Storage Area Network Environment 

In a preferred embodiment, the present invention is applicable to storage 
area networks. As discussed above, a storage area network (SAN) is a high-speed 
sub-network of shared storage devices. A SAN operates to provide access to the 
shared storage devices for all servers on a local area network (LAN), wide area 
network (WAN), or other network coupled to the SAN. 

It is noted that SAN attached storage (S AS) elements can connect directly 
to the SAN, and provide file, database, block, or other types of data access 
services. SAS elements that provide such file access services are commonly 
called Network Attached Storage, or NAS devices. NAS devices can be coupled 
to the SAN, either directly or through their own network configuration. A SAN 
configuration potentially provides an entire pool of available storage to each 
network server, eliminating the conventional dedicated connection between server 
and disk. Furthermore, because a server's mass data storage requirements are 
fulfilled by the SAN, the server's processing power is largely conserved for the 
handling of applications rather than the handling of data requests. For example, 
primary storage 110 and secondary storage 1 12 may be configured as SANs. 

FIG. 8 illustrates an example data communication network 800, according 
to an embodiment of the present invention. Network 800 includes a variety of 
devices which support communication between many different entities, including 
businesses, universities, individuals, government, and financial institutions. As 
shown in FIG. 8, a communication network, or combination of networks, 
interconnects the elements of network 800. Network 800 supports many different 
types of communication links implemented in a variety of architectures. 

Network 800 may be considered to be an example of a storage area 
network that is applicable to the present invention. Network 800 comprises a 
pool of storage devices, including disk arrays 820, 822, 824, 828, 830, and 832. 
Network 800 provides access to this pool of storage devices to hosts/servers 
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comprised by or coupled to network 800. Network 800 may be configured as 
point-to-point, arbitrated loop, or fabric topologies, or combinations thereof. 

Network 800 comprises a switch 812. Switches, such as switch 812, 
typically filter and forward packets between LAN segments. Switch 8 1 2 may be 
an Ethernet switch, fast-Ethernet switch, or another type of switching device 
known to persons skilled in the relevant art(s). In other examples, switch 812 
may be replaced by a router or a hub. A router generally moves data from one 
local segment to another, and to the telecommunications carrier, such as AT & T 
or WorldCom, for remote sites. A hub is a common connection point for devices 
in a network. Suitable hubs include passive hubs, intelligent hubs, and switching 
hubs, and other hub types known to persons skilled in the relevant art(s). 

Various types of terminal equipment and devices may interface with 
network 800. For example, a personal computer 802, a workstation 804, a printer 
806, a laptop mobile device 808, and a handheld mobile device 8 1 0 interface with 
network 800 via switch 812. Further types of terminal equipment and devices 
that may interface with network 800 may include local area network (LAN) 
connections (e.g., other switches, routers, or hubs), personal computers with 
modems, content servers of multi-media, audio, video, and other information, 
pocket organizers, Personal Data Assistants (PDAs), cellular phones, Wireless 
Application Protocol (WAP) phones, and set-top boxes. These and additional 
types of terminal equipment and devices, and ways to interface them with 
network 800, will be known by persons skilled in the relevant art(s) from the 
teachings herein. 

Network 800 includes one or more hosts or servers. For example, 
network 800 comprises server 8 1 4 and server 8 1 6. Servers 8 1 4 and 8 1 6 provide 
devices 802, 804, 806, 808, and 810 with network resources via switch 812. 
Servers 814 and 816 are typically computer systems that process end-user 
requests for data and/or applications. In one example configuration, servers 814 
and 816 provide redundant services. In another example configuration, server 
814 and server 816 provide different services and thus share the processing load 
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needed to serve the requirements of devices 802, 804, 806, 808, and 810. In 
further example configurations, one or both of servers 8 1 4 and 8 1 6 are connected 
to the Internet, and thus server 8 1 4 and/or server 8 1 6 may provide Internet access 
to network 800. One or both of servers 8 1 4 and 8 1 6 may be Windows NT servers 
or UNIX servers, or other servers known to persons skilled in the relevant art(s). 

A SAN appliance or device as described elsewhere herein may be inserted 
into network 800, according to embodiments of the present invention. For 
example, a SAN appliance 818 may to implemented to provide the required 
connectivity between the storage device networking (disk arrays 820, 822, 824, 
828, 830, and 832) and hosts and servers 814 and 816, and to provide the 
additional functionality of the storage mirror of the present invention described 
elsewhere herein. 

Network 800 includes a hub 826. Hub 826 is connected to disk arrays 
828, 830, and 832. Preferably, hub 826 is a fibre channel hub or other device 
used to allow access to data stored on connected storage devices, such as disk 
arrays 828, 830, and 832. Further fibre channel hubs may be cascaded with hub 
826 to allow for expansion of the SAN, with additional storage devices, servers, 
and other devices. In an example configuration for network 800, hub 826 is an 
arbitrated loop hub. In such an example, disk arrays 828, 830, and 832 are 
organized in a ring or loop topology, which is collapsed into a physical star 
configuration by hub 826. Hub 826 allows the loop to circumvent a disabled or 
disconnected device while maintaining operation. 

Network 800 may include one or more switches in addition to switch 812 
that interface with storage devices. For example, a fibre channel switch or other 
high-speed device may be used to allow servers 8 1 4 and 8 1 6 access to data stored 
on connected storage devices, such as disk arrays 820, 822, and 824, via appliance 
818. Fibre channel switches may be cascaded to allow for the expansion of the 
SAN, with additional storage devices, servers, and other devices. 

Disk arrays 820, 822, 824, 828, 830, and 832 are storage devices 
providing data and application resources to servers 814 and 816 through 
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appliance 818 and hub 826. As shown in FIG. 8, the storage of network 800 is 
principally accessed by servers 814 and 816 through appliance 818. The storage 
devices may be fibre channel-ready devices, or SCSI (Small Computer Systems 
Interface) compatible devices, for example. Fibre channel-to-SCSI bridges may 
be used to allow SCSI devices to interface with fibre channel hubs and switches, 
and other fibre channel-ready devices. One or more of disk arrays 820, 822, 824, 
828, 830, and 832 may instead be alternative types of storage devices, including 
tape systems, JBOD3 (Just a Bunch Of Disks), floppy disk drives, optical disk 
drives, and other related storage drive types. 

The topology or architecture of network 800 will depend on the 
requirements of the particular application, and on the advantages offered by the 
chosen topology. One or more hubs 826, one or more switches, and/or one or 
more appliances 818 may be interconnected in any number of combinations to 
increase network capacity. Disk arrays 820, 822, 824, 828, 830, and 832, or 
fewer or more disk arrays as required, may be coupled to network 800 via these 
hubs 826, switches, and appliances 818. 

Communication over a communication network, such as shown in 
network 800 of FIG. 8, is carried out through different layers. FIG. 9 shows a 
simplified five-layered communication model, based on Open System 
Interconnection (OSI) reference model. As shown in FIG. 9, this model includes 
an application layer 908, a transport layer 910, a network layer 920, a data link 
layer 930, and a physical layer 940. As would be apparent to persons skilled in 
the relevant art(s), any number of different layers and network protocols may be 
used as required by a particular application. 

Application layer 908 provides functionality for the different tools and 
information services which are used to access information over the 
communications network. Example tools used to access information over a 
network include, but are not limited to Telnet log-in service 901, IRC chat 902, 
Web service 903, and SMTP (Simple Mail Transfer Protocol) electronic mail 
service 906. Web service 903 allows access to HTTP documents 904, and FTP 
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(File Transfer Protocol) and Gopher files 905. Secure Socket Layer (SSL) is an 
optional protocol used to encrypt communications between a Web browser and 
Web server. 

Transport layer 910 provides transmission control functionality using 
protocols, such as TCP, UDP, SPX, and others, that add information for 
acknowledgments that blocks of the file had been received. 

Network layer 920 provides routing functionality by adding network 
addressing information using protocols such as IP, IPX, and others, that enable 
data transfer over the network. 

Data link layer 930 provides information about the type of media on 
which the data was originated, such as Ethernet, token ring, or fiber distributed 
data interface (FDDI), and others. 

Physical layer 940 provides encoding to place the data on the physical 
transport, such as twisted pair wire, copper wire, fiber optic cable, coaxial cable, 
and others. 

Description of this example environment in these terms is provided for 
convenience only. It is not intended that the invention be limited to application 
in this example environment. In fact, after reading the description herein, it will 
become apparent to persons skilled in the relevant art(s) how to implement the 
invention in alternative environments. Further details on designing, configuring, 
and operating storage area networks are provided in Tom Clark, "Designing 
Storage Area Networks: A Practical Reference for Implementing Fibre Channel 
SANs" (1999), which is incorporated herein by reference in its entirety. 

SAN Appliance Embodiments 

The present invention may be implemented in and operated from a SAN 
appliance that interfaces between the hosts and the storage subsystems comprising 
the SAN. The present invention is completely host (operating system) 
independent and storage system independent. As opposed to host-based remote 
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mirroring solutions, data storage mirroring according to the present invention 
does not require special host software. As opposed to storage array based 
mirroring solutions, data storage mirroring according to the present invention is 
not tied to a specific storage vendor and operates with any type of storage, 
including fibre channel and SCSI. 

As described above, the present invention may be implemented with a 
pair of SAN appliances, such as the SANLink™ appliance. For instance, local 
storage server 106 and mirror storage server 108 may each be SAN appliances. 
In embodiments, a web-based user interface may be used to centrally manage 
mirroring for the entire SAN. 

Example Computer System 

An example of a computer system 1040 is shown in FIG. 10. The 
computer system 1040 represents any single or multi-processor computer. In 
conjunction, single-threaded and multi-threaded applications can be used. 
Unified or distributed memory systems can be used. Computer system 1040, or 
portions thereof, may be used to implement the present invention. For example, 
each of the servers of the storage mirror of the present invention may comprise 
software running on a computer system such as computer system 1040. 

In one example, the storage mirror of the present invention is 
implemented in a multi-platform (platform independent) programming language 
such as JAVA 1.1, programming language/structured query language (PL/SQL), 
hyper-text mark-up language (HTML), practical extraction report language 
(PERL), common gateway interface/structured query language (CGI/SQL) or the 
like. Java™- enabled and JavaScript™- enabled browsers are used, such as, 
Netscape™, HotJava™, and Microsoft™ Explorer™ browsers. Active content 
Web pages can be used. Such active content Web pages can include Java™ 
applets or ActiveX™ controls, or any other active content technology developed 
now or in the future. The present invention, however, is not intended to be 
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limited to Java™, JavaScript™, or their enabled browsers, and can be 
implemented in any programming language and browser, developed now or in the 
future, as would be apparent to a person skilled in the art given this description. 

In another example, the storage mirror of the present invention, including 
worker thread generator module 202, connection thread generator module 204, 
and heartbeat thread generator module 206, may be implemented using a 
high-level programming language (e.g., C++) and applications written for the 
Microsoft Windows™ environment. It will be apparent to persons skilled in the 
relevant art(s) how to implement the invention in alternative embodiments from 
the teachings herein. 

Computer system 1 040 includes one or more processors, such as processor 
1 044. One or more processors 1 044 can execute software implementing routines 
described above, such as shown in flowchart 1200. Each processor 1044 is 
connected to a communication infrastructure 1042 (e.g., a communications bus, 
cross-bar, or network). Various software embodiments are described in terms of 
this exemplary computer system. After reading this description, it will become 
apparent to a person skilled in the relevant art how to implement the invention 
using other computer systems and/or computer architectures. For example, n 
worker threads 208, n connection threads 210, heartbeat sender thread 212, 
heartbeat receiver thread 214, and re-synchronization threads 1102 may be 
executed on one or more of processor 1044. 

Computer system 1 040 can include a display interface 1 002 that forwards 
graphics, text, and other data from the communication infrastructure 1042 (or 
from a frame buffer not shown) for display on the display unit 1030. 

Computer system 1040 also includes a main memory 1046, preferably 
random access memory (RAM), and can also include a secondary memory 1048. 
The secondary memory 1048 can include, for example, a hard disk drive 1050 
and/or a removable storage drive 1052, representing a floppy disk drive, a 
magnetic tape drive, an optical disk drive, etc. The removable storage drive 1 052 
reads from and/or writes to a removable storage unit 1054 in a well known 
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manner. Removable storage unit 1054 represents a floppy disk, magnetic tape, 
optical disk, etc., which is read by and written to by removable storage drive 
1052. As will be appreciated, the removable storage unit 1054 includes a 
computer usable storage medium having stored therein computer software and/or 
data. 

In alternative embodiments, secondary memory 1048 may include other 
similar means for allowing computer programs or other instructions to be loaded 
into computer system 1040. Such means can include, for example, a removable 
storage unit 1062 and an interface 1060. Examples can include a program 
cartridge and cartridge interface (such as that found in video game devices), a 
removable memory chip (such as an EPROM, or PROM) and associated socket, 
and other removable storage units 1 062 and interfaces 1 060 which allow software 
and data to be transferred from the removable storage unit 1062 to computer 
system 1040. 

Computer system 1 040 can also include a communications interface 1 064. 
Communications interface 1064 allows software and data to be transferred 
between computer system 1040 and external devices via communications path 
1066. Examples of communications interface 1064 can include a modem, a 
network interface (such as Ethernet card), a communications port, interfaces 
described above, etc. Software and data transferred via communications interface 
1064 are in the form of signals which can be electronic, electromagnetic, optical 
or other signals capable of being received by communications interface 1 064, via 
communications path 1066. Note that communications interface 1064 provides 
a means by which computer system 1040 can interface to a network such as the 
Internet. 

The present invention can be implemented using software running (that 
is, executing) in an environment similar to that described above with respect to 
FIG. 8. In this document, the term "computer program product 1 ' is used to 
generally refer to removable storage unit 1054, a hard disk installed in hard disk 
drive 1 050, or a carrier wave carrying software over a communication path 1 066 
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(wireless link or cable) to communication interface 1064. A computer useable 
medium can include magnetic media, optical media, or other recordable media, 
or media that transmits a carrier wave or other signal. These computer program 
products are means for providing software to computer system 1040. 

Computer programs (also called computer control logic) are stored in 
main memory 1046 and/or secondary memory 1048. Computer programs can 
also be received via communications interface 1064. Such computer programs, 
when executed, enable the computer system 1040 to perform the features of the 
present invention as discussed herein. In particular, the computer programs, when 
executed, enable the processor 1044 to perform features of the present invention. 
Accordingly, such computer programs represent controllers of the computer 
system 1040. 

The present invention can be implemented as control logic in software, 
firmware, hardware or any combination thereof. In an embodiment where the 
invention is implemented using software, the software may be stored in a 
computer program product and loaded into computer system 1040 using 
removable storage drive 1052, hard disk drive 1050, or interface 1060. 
Alternatively, the computer program product may be downloaded to computer 
system 1 040 over communications path 1 066. The control logic (software), when 
executed by the one or more processors 1044, causes the processor(s) 1044 to 
perform functions of the invention as described herein. 

In another embodiment, the invention is implemented primarily in 
firmware and/or hardware using, for example, hardware components such as 
application specific integrated circuits (ASICs). Implementation of a hardware 
state machine so as to perform the functions described herein will be apparent to 
persons skilled in the relevant art(s) from the teachings herein. 
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Conclusion 

While various embodiments of the present invention have been described 
above, it should be understood that they have been presented by way of example 
only, and not limitation. It will be apparent to persons skilled in the relevant art 
that various changes in form and detail can be made therein without departing 
from the spirit and scope of the invention. Thus, the breadth and scope of the 
present invention should not be limited by any of the above-described exemplary 
embodiments, but should be defined only in accordance with the following claims 
and their equivalents. 
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